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SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: Falb, Dean 

(ii) TITLE OF THE INVENTION: COMPOSITIONS AND METHODS FOR 

THE TREATMENT AND DIAGNOSIS OF 
CARDIOVASCULAR DISEASE 

(iii) NUMBER OF SEQUENCES: 44 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PENNIE & EDMONDS LLP 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: NY 

(E) COUNTRY: USA 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/799,910 

(B) FILING DATE: 13-FEB-1997 

( C ) CL AS S I F I CAT I ON : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/011,787 

(B) FILING DATE: 16-FEB-1996 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Coruzzi, Laura A 

(B) REGISTRATION NUMBER: 30,742 

(C) REFERENCE/DOCKET NUMBER: 7853-067-999 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212)7909090 

(B) TELEFAX: (212)8699741 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : both 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
( ix ) FEATURE : 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 162 1871 

(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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V 



GGCACGAGTC GGAGCCGGGC GGAGGGGAGG GGGGAAAGAG GAGCGCAGGG TGAGAGTGAG 60 
CCGCAGGCTT CGGGAGGCGA GGGGGCGGGG GGAGCAGCGC CGAGGYCGCC GCCTCCGCCT 120 
CCGCCGCCTA GGACTAGGGG GTGGGGGACG GACAAGCCCC G ATG CCG GGG GAG ACG 176 

Met Pro Gly Glu Thr 
1 5 

GAA GAG CCG AGA CCC CCG GAG CAG CAG GAC CAG GAA GGG GGA GAG GCG 224 
Glu Glu Pro Arg Pro Pro Glu Gin Gin Asp Gin Glu Gly Gly Glu Ala 
10 15 20 

GCC AAG GCG GCT CCG GAG GAG CCC CAA CAA CGG CCC CCT GAG GCG GTC 272 
Ala Lys Ala Ala Pro Glu Glu Pro Gin Gin Arg Pro Pro Glu Ala Val 
25 30 35 

GCG GCG GCG CCT GCA GGG ACC ACT AGC AGC CGC GTG CTG AGG GGA GGT 320 
Ala Ala Ala Pro Ala Gly Thr Thr Ser Ser Arg Val Leu Arg Gly Gly 
40 45 50 

CGG GAC CGA GGC CGG GCC GCT GCG GCC GCC GCC GCC GCA GCT GTG TCC 368 
Arg Asp Arg Gly Arg Ala Ala Ala Ala Ala Ala Ala Ala Ala Val Ser 
55 60 65 

CGC CGG AGG AAG GCC GAG TAT CCC CGC CGG CGG AGG AGC AGC CCC AGC 416 
Arg Arg Arg Lys Ala Glu Tyr Pro Arg Arg Arg Arg Ser Ser Pro Ser 
70 75 80 85 

GCC AGG CCT CCC GAC GTC CCC GGG CAG CAG CCC CAG GCC GCG AAG TCC 464 
Ala Arq Pro Pro Asp Val Pro Gly Gin Gin Pro Gin Ala Ala Lys Ser 
90 95 100 

CCG TCT CCA GTT CAG GGC AAG AAG AGT CCG CGA CTC CTA TGC ATA GAA 512 
Pro Ser Pro Val Gin Gly Lys Lys Ser Pro Arg Leu Leu Cys lie Glu 
105 HO H5 

AAA GTA ACA ACT GAT AAA GAT CCC AAG GAA GAA AAA GAG GAA GAA GAC 560 
Lys Val Thr Thr Asp Lys Asp Pro Lys Glu Glu Lys Glu Glu Glu Asp 
120 125 130 

GAT TCT GCC CTC CCT CAG GAA GTT TCC ATT GCT GCA TCT AGA CCT AGC 608 
Asp Ser Ala Leu Pro Gin Glu Val Ser He Ala Ala Ser Arg Pro Ser 
135 140 145 

CGG GGC TGG CGT AGT AGT AGG ACA TCT GTT TCT CGC CAT CGT GAT ACA 656 
Ara Gly Trp Arg Ser Ser Arg Thr Ser Val Ser Arg His Arg Asp Thr 
150 155 160 165 

GAG AAC ACC CGA AGC TCT CGG TCC AAG ACC GGT TCA TTG CAG CTC ATT 704 
Glu Asn Thr Arg Ser Ser Arg Ser Lys Thr Gly Ser Leu Gin Leu He 
170 175 180 

TGC AAG TCA GAA CCA AAT ACA GAC CAA CTT GAT TAT GAT GTT GGA GAA 752 
Cys Lys Ser Glu Pro Asn Thr Asp Gin Leu Asp Tyr Asp Val Gly Glu 
185 190 195 

GAG CAT CAG TCT CCA GGT GGC ATT AGT GGT GAA GAG GAA GAG GAG GAG 800 
Glu His Gin Ser Pro Gly Gly He Ser Gly Glu Glu Glu Glu Glu Glu 
200 205 210 

GAA GAA GAG ATG TTA ATC AGT GAA GAG GAG ATA CCA TTC AAA GAT GAT 848 
Glu Glu Glu Met Leu He Ser Glu Glu Glu He Pro Phe Lys Asp Asp 
215 220 225 

CCA AGA GAT GAG ACC TAC AAA CCC CAC TTA GAA AGG GAA ACC CCA AAG 896 
Pro Ara Asp Glu Thr Tyr Lys Pro His Leu Glu Arg Glu Thr Pro Lys 
230 235 240 245 

CCA CGG AGA AAA TCA GGG AAG GTA AAA GAA GAG AAG GAG AAG AAG GAA 944 
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Pro Arg Arg Lys Ser Gly Lys Val Lys Glu Glu Lys Glu Lys Lys Glu 
250 255 260 

ATT AAA GTG GAA GTA GAG GTG GAG GTG AAA GAA GAG GAG AAT GAA ATT 992 

lie Lys Val Glu Val Glu Val Glu Val Lys Glu Glu Glu Asn Glu lie 
265 270 275 

AGA GAG GAT GAG GAA CCT CCA AGG AAG AGA GGA AGA AGA CGA AAA GAT 1040 

Arg Glu Asp Glu Glu Pro Pro Arg Lys Arg Gly Arg Arg Arg Lys Asp 

280 285 290 

GAC AAA AGT CCA CGT TTA CCC AAA AGG AGA AAA AAG CCT CCA ATC CAG 1088 

Asp Lys Ser Pro Arg Leu Pro Lys Arg Arg Lys Lys Pro Pro lie Gin 
295 300 305 

TAT GTC CGT TGT GAG ATG GAA GGA TGT GGA ACT GTC CTT GCC CAT CCT 1136 

Tyr Val Arg Cys Glu Met Glu Gly Cys Gly Thr Val Leu Ala His Pro 
310 315 320 325 

CGC TAT TTG CAG CAC CAC ATT AAA TAC CAG CAT TTG CTG AAG AAG AAA 1184 

Arg Tyr Leu Gin His His lie Lys Tyr Gin His Leu Leu Lys Lys Lys 
330 335 340 

TAT GTA TGT CCC CAT CCC TCC TGT GGA CGA CTC TTC AGG CTT CAG AAG 1232 

Tyr Val Cys Pro His Pro Ser Cys Gly Arg Leu Phe Arg Leu Gin Lys 
345 350 355 

CAA CTT CTG CGA CAT GCC AAA CAT CAT ACA GAT CAA AGG GAT TAT ATC 1280 

Gin Leu Leu Arg His Ala Lys His His Thr Asp Gin Arg Asp Tyr lie 

360 365 370 

TGT GAA TAT TGT GCT CGG GCC TTC AAG AGT TCC CAC AAT CTG GCA GTG 1328 

Cys Glu Tyr Cys Ala Arg Ala Phe Lys Ser Ser His Asn Leu Ala Val 
375 380 385 

CAC CGG ATG ATT CAC ACT GGC GAG AAG CCA TTA CAA TGT GAG ATC TGT 1376 

His Arg Met lie His Thr Gly Glu Lys Pro Leu Gin Cys Glu lie Cys 
390 395 400 405 

GGA TTT ACT TGT CGA CAA AAG GCA TCT CTT AAT TGG CAC ATG AAG AAA 1424 

Gly Phe Thr Cys Arg Gin Lys Ala Ser Leu Asn Trp His Met Lys Lys 
410 415 420 

CAT GAT GCA GAC TCC TTC TAC CAG TTT TCT TGC AAT ATC TGT GGC AAA 1472 

His Asp Ala Asp Ser Phe Tyr Gin Phe Ser Cys Asn He Cys Gly Lys 
425 430 435 

AAA TTT GAG AAG AAG GAC AGC GTA GTG GCA CAC AAG GCA AAA AGC CAC 1520 

Lys Phe Glu Lys Lys Asp Ser Val Val Ala His Lys Ala Lys Ser His 

440 445 450 

CCT GAG GTG CTG ATT GCA GAA GCT CTG GCT GCC AAT GCA GGC GCC CTC 1568 

Pro Glu Val Leu He Ala Glu Ala Leu Ala Ala Asn Ala Gly Ala Leu 
455 460 465 

ATC ACC AGC ACA GAT ATC TTG GGC ACT AAC CCA GAG TCC CTG ACG CAG 1616 

He Thr Ser Thr Asp lie Leu Gly Thr Asn Pro Glu Ser Leu Thr Gin 
470 475 480 485 

CCT TCA GAT GGT CAG GGT CTT CCT CTT CTT CCT GAG CCC TTG GGA AAC 1664 

Pro Ser Asp Gly Gin Gly Leu Pro Leu Leu Pro Glu Pro Leu Gly Asn 
490 495 500 

TCA ACC TCT GGA GAG TGC CTA CTG TTA GAA GCT GAA GGG ATG TCA AAG 1712 

Ser Thr Ser Gly Glu Cys Leu Leu Leu Glu Ala Glu Gly Met Ser Lys 
505 510 515 
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TCA TAC TGC AGT GGG ACG GAA CGG GTG AGC CTG ATG GOT GAT GGG AAG 1760 
Ser Tyr Cys Ser Gly Thr Glu Arg Val Ser Leu Met Ala Asp Gly Lys 
520 525 530 

ATC TTT GTG GGA AGC GGC AGC AGT GGA GGC ACT GAA GGG CTG GTT ATG 1808 
He Phe Val Gly Ser Gly Ser Ser Gly Gly Thr Glu Gly Leu Val Met 
535 540 545 

AAC TCA GAT ATA CTC GGT GCT ACC ACA GAG GTT CTG ATT GAA GAT TCA 1856 
Asn Ser Asp He Leu Gly Ala Thr Thr Glu Val Leu lie Glu Asp Ser 
550 555 560 565 

GAC TCT GCC GGA CCT TAGTGGACAG GAAGACTTGG GGCATGGGAC AGCT C AG ACT T 1912 
Asp Ser Ala Gly Pro 
570 

TGTATTTAAA AGTTAAAAAG GACAAAAAAA AAAAAAAAAA A 1953 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 





(xi) SEQUENCE 


DESCRIPTION: 


SEQ ID 


NO: 2 


I ♦ 










Met 


Pro 


Gly 


Glu 


Thr 


Glu 


Glu 


Pro 


Arg 


Pro 


Pro 


Glu 


Gin 


Gin 


Asp 


Gin 


1 






5 










10 










15 




Glu 


Gly 


Gly 


Glu 


Ala 


Ala 


Lys 


Ala 


Ala 


Pro 


Glu 


Glu 


Pro 


Gin 


Gin 


Arg 




20 










25 










30 






Pro 


Pro 


Glu 


Ala 


Val 


Ala 


Ala 


Ala 


Pro 


Ala 


Gly 


Thr 


Thr 


Ser 


Ser 


Arg 




35 










40 










45 






Ala 


Val 


Leu 


Arg 


Gly 


Gly 


Arg 


Asp 


Arg 


Gly 


Arg 


Ala 


Ala 


Ala 


Ala 


Ala 




50 








55 










60 










Ala 


Ala 


Ala 


Val 


Ser 


Arg 


Arg 


Arg 


Lys 


Ala 


Glu 


Tyr 


Pro 


Arg 


Arg 


Arg 


65 










70 










75 








Gin 


80 


Arg 


Ser 


Ser 


Pro 


Ser 


Ala 


Arg 


Pro 


Pro 


Asp 


Val 


Pro 


Gly 


Gin 


Pro 








85 










90 










95 




Gin 


Ala 


Ala 


Lys 


Ser 


Pro 


Ser 


Pro 


Val 


Gin 


Gly 


Lys 


Lys 


Ser 


Pro 


Arg 








100 










105 










110 




Glu 


Leu 


Leu 


Cys 


He 


Glu 


Lys 


Val 


Thr 


Thr 


Asp 


Lys 


Asp 


Pro 


Lys 


Glu 






115 








120 










125 






Ala 


Lys 


Glu 


Glu 


Glu 


Asp 


Asp 


Ser 


Ala 


Leu 


Pro 


Gin 


Glu 


Val 


Ser 


He 


130 










135 










140 






Val 




Ala 


Ser 


Arg 


Pro 


Ser 


Arg 


Gly 


Trp 


Arg 


Ser 


Ser 


Arg 


Thr 


Ser 


Ser 


145 








150 










155 








Thr 


160 


Arg 


His 


Arg 


Asp 


Thr 


Glu 


Asn 


Thr 


Arg 


Ser 


Ser 


Arg 


Ser 


Lys 


Gly 




165 










170 










175 




Ser 


Leu 


Gin 


Leu 


He 


Cys 


Lys 


Ser 


Glu 


Pro 


Asn 


Thr 


Asp 


Gin 


Leu 


Asp 








180 






185 










190 






Tyr 


Asp 


Val 


Gly 


Glu 


Glu 


His 


Gin 


Ser 


Pro 


Gly 


Gly 


He 


Ser 


Gly Glu 


195 








200 










205 






He 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Met 


Leu 


He 


Ser 


Glu 


Glu 


Glu 


210 










215 










220 








Glu 


Pro 


Phe 


Lys 


Asp 


Asp 


Pro 


Arg 


Asp 


Glu 


Thr 


Tyr 


Lys 


Pro 


His 


Leu 


225 




230 










235 








Glu 


240 


Arg 


Glu 


Thr 


Pro 


Lys 


Pro 


Arg 


Arg 


Lys 


Ser 


Gly 


Lys 


Val 


Lys 


Glu 








245 








250 










255 


Glu 


Lys 


Glu 


Lys 


Lys 


Glu 


He 


Lys 


Val 


Glu 


Val 


Glu 


Val 


Glu 


Val 


Lys 




260 










265 










270 




Gly 


Glu 


Glu 


Asn 


Glu 


He 


Arg 


Glu 


Asp 


Glu 


Glu 


Pro 


Pro 


Arg 


Lys 


Arg 






275 








280 










285 
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Arg 


Arg 




Lvs 


Asp 


Asp 


Lys 


Ser 


Pro 


Arg 


Leu 


Pro 


Lys 


Arg 


Arg 


Lys 


290 








295 










300 








Thr 


Ly s 


Pro 


Pro 


lie 


Gin 


Tvr 


Val 


Arg 


Cys 


Glu 


Met 


Glu 


Gly 


Cys 


Gly 


305 










310 










315 










320 


Val 


Leu 


Ala 


His 


Pro 


Ara 


Tyr 


Leu 


Gin 


His 


His 


He 


Lys 


Tyr 


Gin 


His 










325 






330 










335 




Levi 


Leu 


Lys 


Lvs 


Lvs 


Tvr 


Val 


Cys 


Pro 


His 


Pro 


Ser 


Cys 


Gly 


Arg 


Leu 






340 








345 










350 






Phe 


Arg 


Leu 


Gin 


Lvs 


Gin 


Leu 


Leu 


Arg 


His 


Ala 


Lys 


His 


His 


Thr 


Asp 




355 








360 










365 








Gin 


Arg 


Aso 


Tvr 


He 


Cvs 


Glu 


Tyr 


Cys 


Ala 


Arg 


Ala 


Phe 


Lys 


Ser 


Ser 




370 






375 










380 










His 


Asn 


Leu 


Ala 


Val 


His 


Arg 


Met 


He 


His 


Thr 


Gly 


Glu 


Lys 


Pro 


Leu 


385 










390 








395 










400 


Gin 


Cy s 


Glu 


He 


Cvs 


Gly 


Phe 


Thr 


Cys 


Arg 


Gin 


Lys 


Ala 


Ser 


Leu 


Asn 








405 










410 










415 






His 


Met 


Lvs 


LVS 


His 


Asp 


Ala 


Asp 


Ser 


Phe 


Tyr 


Gin 


Phe 


Ser 


Cys 






420 








425 










430 




His 


Asn 


lie 


Cys 


Glv 


LVS 


LVS 


Phe 


Glu 


Lys 


Lys 


Asp 


Ser 


Val 


Val 


Ala 






435 








440 










445 








Ly s 


Ala 


Lys 


Ser 


His 


Pro 


Glu 


Val 


Leu 


He 


Ala 


Glu 


Ala 


Leu 


Ala 


Ala 


450 








455 










460 










Asn 


Ala 


Glv 


Ala 


Leu 


He 


Thr 


Ser 


Thr 


Asp 


He 


Leu 


Gly 


Thr 


Asn 


Pro 


465 








470 










475 










480 


Glu 


Ser 


Leu 


Thr 


Gin 


Pro 


Ser 


Asp 


Gly 


Gin 


Gly 


Leu 


Pro 


Leu 


Leu 


Pro 








485 








490 










495 


Ala 


Glu 


Pro 


Leu 


Gly 
500 


Asn 


Ser 


Thr 


Ser 


Gly 
505 


Glu 


Cys 


Leu 


Leu 


Leu 
510 


Glu 


Glu 


Glv 


Met 


Ser 


Lys 


Ser 


Tyr 


Cys 


Ser 


Gly 


Thr 


Glu 


Arg 


Val 


Ser 


Leu 




515 








520 










525 








Met 


Ala 


Asp 


Gly 


Lys 


He 


Phe 


Val 


Gly 


Ser 


Gly 


Ser 


Ser 


Gly 


Gly 


Thr 




530 






535 










540 










Glu 


Gly 


Leu 


Val 


Met 


Asn 


Ser 


Asp 


He 


Leu 


Gly 


Ala 


Thr 


Thr 


Glu 


Val 


545 








550 










555 










560 


Leu 


lie 


Glu 


Asp 


Ser 


Asp 


Ser 


Ala 


Gly 


Pro 




















565 










570 















(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3103 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 288... 1565 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCACGAGCGG AGAGCCGCGC AGGGCGCGGG CCGCGCGGGG TGGGGCAGCC GGAGCGCAGG 60 
CCCCCGATCC CCGGCGGGCG CCCCCGGGCC CCCGCGCGCG CCCCGGCCTC CGGGAGACTG 120 
GCGCATGCCA CGGAGCGCCC CTCGGGCCGC CGCCGCTCCT GCCCGGGCCC CTGCTGCTGC 180 
TGCTGTCGCC TGCGCCTGCT GCCCCAACTC GGCGCCCGAC TCACAAAGAA ACATCATGTT 240 
CGCTCCTTAG CAGGCAAACG ACTTTTCTCC TCGCCTCCTC GCCCCGC ATG TTC AGG 296 

Met Phe Arg 
1 

ACC AAA CGA TCT GCG CTC GTC CGG CGT CTC TGG AGG AGC CGT GCG CCC 344 
Thr Lys Arg Ser Ala Leu Val Arg Arg Leu Trp Arg Ser Arg Ala Pro 
5 10 15 
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GGC GGC GAG GAC GAG GAG GAG GGC GCA GGG GGA GGT GGA GGA GGA GGC 392 
Glv Gly Glu Asp Glu Glu Glu Gly Ala Gly Gly Gly Gly Gly Gly Gly 
20 25 30 35 

GAG CTG CGG GGA GAA GGG GCG ACG GAC AGC CGA GCG CAT GGG GCC GGT 440 
Glu Leu Arg Gly Glu Gly Ala Thr Asp Ser Arg Ala His Gly Ala Gly 
40 45 50 

GGC GGC GGC CCG GGC AGG GCT GGA TGC TGC CTG GGC AAG GCG GTG CGA 488 
Gly Gly Gly Pro Gly Arg Ala Gly Cys Cys Leu Gly Lys Ala Val Arg 
55 60 65 

GGT GCC AAA GGT CAC CAC CAT CCC CAC CCG CCA GCC GCG GGC GCC GGC 536 
Glv Ala Lys Gly His His His Pro His Pro Pro Ala Ala Gly Ala Gly 
70 75 80 

GCG GCC GGG GGC GCC GAG GCG GAT CTG AAG GCG CTC ACG CAC TCG GTG 584 
Ala Ala Gly Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr His Ser Val 
85 90 95 

CTC AAG AAA CTG AAG GAG CGG CAG CTG GAG CTG CTG CTC CAG GCC GTG 632 
Leu Lys Lys Leu Lys Glu Arg Gin Leu Glu Leu Leu Leu Gin Ala Val 
100 105 HO H5 

GAG TCC CGC GGC GGG ACG CGC ACC GCG TGC CTC CTG CTG CCC GGC CGC 680 
Glu Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu Pro Gly Arg 
120 125 130 

CTG GAC TGC AGG CTG GGC CCG GGG GCG CCC GCC GGC GCG CAG CCT GCG 728 
Leu Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala Gly Ala Gin Pro Ala 
135 140 145 

CAG CCG CCC TCG TCC TAC TCG CTC CCC CTC CTG CTG TGC AAA GTG TTC 776 
Gin Pro Pro Ser Ser Tyr Ser Leu Pro Leu Leu Leu Cys Lys Val Phe 
150 155 160 

AGG TGG CCG GAT CTC AGG CAT TCC TCG GAA GTC AAG AGG CTG TGT TGC 824 
Arg Trp Pro Asp Leu Arg His Ser Ser Glu Val Lys Arg Leu Cys Cys 
165 170 175 

TGT GAA TCT TAC GGG AAG ATC AAC CCC GAG CTG GTG TGC TGC AAC CCC 872 
Cys Glu Ser Tyr Gly Lys lie Asn Pro Glu Leu Val Cys Cys Asn Pro 
180 185 190 195 

CAT CAC CTT AGC CGA CTC TGC GAA CTA GAG TCT CCC CCC CCT CCT TAC 920 
His His Leu Ser Arg Leu Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr 
200 205 210 

TCC AGA TAC CCG ATG GAT TTT CTC AAA CCA ACT GCA GAC TGT CCA GAT 968 
Ser Arq Tyr Pro Met Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro Asp 
215 220 225 

GCT GTG CCT TCC TCC GCT GAA ACA GGG GGA ACG AAT TAT CTG GCC CCT 1016 
Ala Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr Leu Ala Pro 
230 235 240 

GGG GGG CTT TCA GAT TCC CAA CTT CTT CTG GAG CCT GGG GAT CGG TCA 1064 
Gly Gly Leu Ser Asp Ser Gin Leu Leu Leu Glu Pro Gly Asp Arg Ser 
245 250 255 

CAC TGG TGC GTG GTG GCA TAC TGG GAG GAG AAG ACG AGA GTG GGG AGG 1112 
His Trp Cys Val Val Ala Tyr Trp Glu Glu Lys Thr Arg Val Gly Arg 
260 265 270 275 

CTC TAC TGT GTC CAG GAG CCC TCT CTG GAT ATC TTC TAT GAT CTA CCT 1160 
Leu Tyr Cys Val Gin Glu Pro Ser Leu Asp II Phe Tyr Asp Leu Pro 
280 285 290 
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CAG GGG AAT GGC TTT TGC CTC GGA CAG CTC AAT TCG GAC AAC AAG AGT f 
Gin Gly Asn Gly Phe Cys Leu Gly Gin Leu Asn S r Asp Asn Lys Ser 
295 300 305 

CAG CTG GTG CAG AAG GTG CGG AGC AAA ATC GGC TGC GGC ATC CAG CTG 
Gin Leu Val Gin Lys Val Arg Ser Lys lie Gly Cys Gly He Gin Leu 
310 315 320 

ACG CGG GAG GTG GAT GGT GTG TGG GTG TAC AAC CGC AGC AGT TAC CCC 
Thr Arg Glu Val Asp Gly Val Trp Val Tyr Asn Arg Ser Ser Tyr Pro 
325 330 335 

ATC TTC ATC AAG TCC GCC ACA CTG GAC AAC CCG GAC TCC AGG ACG CTG 
He Phe He Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser Arg Thr Leu 
340 345 350 355 

TTG GTA CAC AAG GTG TTC CCC GGT TTC TCC ATC AAG GCT TTC GAC TAC 
Leu Val His Lys Val Phe Pro Gly Phe Ser He Lys Ala Phe Asp Tyr 
360 365 370 

GAG AAG GCG TAC AGC CTG CAG CGG CCC AAT GAC CAC GAG TTT ATG CAG 
Glu Lvs Ala Tyr Ser Leu Gin Arg Pro Asn Asp His Glu Phe Met Gin 
375 380 385 

CAG CCG TGG ACG GGC TTT ACC GTG CAG ATC AGC TTT GTG AAG GGC TGG 
Gin Pro Trp Thr Gly Phe Thr Val Gin He Ser Phe Val Lys Gly Trp 
390 395 400 

GGT CAG TGC TAC ACC CGC CAG TTC ATC AGC AGC TGC CCG TGC TGG CTA 
Gly Gin Cys Tyr Thr Arg Gin Phe He Ser Ser Cys Pro Cys Trp Leu 
405 410 415 

GAG GTC ATC TTC AAC AGC CGG TAGCCGCGTG CGGAGGGGAC AGAGCGTGAG CTGA 
Glu Val He Phe Asn Ser Arg 
420 425 



1208 



1256 



1304 



1352 



1400 



1448 



1496 



1544 



1599 



GCAGGCCACA 
TGCAAACTCT 
TTTGTGTTCT 
TTTTTTGGAA 
GAAATCCAAG 
GGAGTGTGTG 
GGGGAGACAA 
CAGTCCCAAG 
TGGGCAAGCT 
GGGAAAGCCC 
AGCAGCGAGC 
TTCCTGCCAG 
CCTGAGATGG 
GCCCTCTCCC 
AGAAAACCAG 
GCAAGCGGGG 
TTGGTCCGTC 
TCTACAACTC 
AGACTTTAAC 
TCATTGGAAA 
AGATTTATTA 
TATAAAGACG 
AATAAAGAAA 
AAGATAAACA 
CTAAAGGAGT 
AAAA 



CTTCAAACTA 
TTGGTCGTTT 
GTTTTGTTTC 
GAAGGGGCAG 
CACCACCAAA 
TGAGTGTGAG 
CGTGCTCTTT 
CGGTGTCTCT 
GGCGGCTGGG 
CTGCCCCTCC 
CTGCTAGTGG 
CCCCTGCCAT 
GCTTCCCAAG 
CTCAGCCTCT 
CTCAGCACTG 
ATGTCCCTGG 
ATAACCCAAG 
ATACACTCGT 
ATAAGCTATT 
TTTAGCATTG 
AAGAAAAAAA 
TTTTATAAGC 
AGATGCACTT 
CAAGATTGGT 
TTATGTTCCA 



CTTTGCTGCT 
TTTTTTTGTT 
GCTCTTTGAG 
GTATGATCGG 
CACAGTGTAT 
TGTGCGGCTG 
GTTTTGTGTC 
CCTGCCCCTT 
GTCCCAGCAG 
TCTCCCTCAT 
CCGAACCAGA 
TGTAGCGTCT 
GGCTGCCGGG 
CCCCTGCCTG 
CCTGCTCCCA 
GAGGGACATG 
GTACCATCCT 
ATGATACTTC 
TTTCTAACTA 
TAGTGCTTTG 
TGTATTTTAT 
ATCATTATTT 
TGCTTTAATA 
GTTTTTTCCT 
TTAAACGATT 



AATATTTTCC 
TGTTGGTTGG 
AAATAGCTTA 
CAGGACACCC 
GAAGGGGGGC 
TGTGTGCACG 
TCTTATGGAT 
GGACACGCTC 
CTGCCAGGAG 
CAAGGACACG 
ACCAATTATT 
TTCTTTTTTG 
GCAGCCCCCT 
CCCTGGTGAC 
TCCTGTGTGT 
CTTAGCAGTC 
AGGCTGACAC 
GACACTGTTC 
CAAAGGTTTA 
AGAGAGAAAG 
GTTATATATA 
ATGTATTGTG 
TAAATGCAAA 
ATGGGTGTTA 
TTTAAAATGT 



TCCTGAGTGC 
TTTTCTTCTT 
TGAAAAGAAT 
TGATAGGAAG 
GGTCATCATT 
CGTGTGCAGG 
GTCCCCAGCA 
AGTGGGGCAG 
CACGGCTCTG 
GGCCTGTCCA 
TTCATCCTTG 
GCCATCTGCT 
CACAGTATTG 
ATCAGGTTTT 
TAAGCTCTGC 
CCCTTCCCTC 
CTAACTCTTC 
TTAGCTCAAT 
AATGAACAAG 
GACTCCTGAA 
AATATATTAT 
CAATGTGTAT 
TAACAAATGC 
TCACCTAGCT 
ACACTTGAAA 



TTGCTTTTCA 
CTCGTCCTCG 
TGTTGGGGGT 
AGGGGAAGCA 
TCACTTGTCA 
AGCGGCAGAT 
GAGAGGTTTG 
AGGCAGTACC 
TCCCCAGCCT 
CAGGCTTCTG 
TCTTATTCCC 
CCTGGATCTC 
CTCACCCAGT 
TCCCGGACTT 
TATTAGGCCA 
CAAGAAGGAT 
TTTCATTTCT 
GAGCATGTTT 
AGAAGCATTC 
AAAAAACCTG 
TACTTGTAAA 
AAACAAGAAA 
CAAATTAAAA 
GAATGTTTTT 
AAAAAAAAAA 



1659 
1719 
1779 
1839 
1899 
1959 
2019 
2079 
2139 
2199 
2259 
2319 
2379 
2439 
2499 
2559 
2619 
2679 
2739 
2799 
2859 
2919 
2979 
3039 
3099 
3103 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 426 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Phe Arg Thr Lys Arg Ser Ala Leu Val Arg Arg Leu Trp Arg Ser 

15 10 15 

Arg Ala Pro Gly Gly Glu Asp Glu Glu Glu Gly Ala Gly Gly Gly Gly 

20 25 30 

Gly Gly Gly Glu Leu Arg Gly Glu Gly Ala Thr Asp Ser Arg Ala His 
35 40 45 ; 

Gly Ala Gly Gly Gly Gly Pro Gly Arg Ala Gly Cys Cys Leu Gly Lys 

50 55 60 

Ala Val Arg Gly Ala Lys Gly His His His Pro His Pro Pro Ala Ala 
65 70 75 80 

Gly Ala Gly Ala Ala Gly Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr 

85 90 95 

His Ser Val Leu Lys Lys Leu Lys Glu Arg Gin Leu Glu Leu Leu Leu 

100 105 HO 

Gin Ala Val Glu Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu 

115 120 125 

Pro Gly Arg Leu Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala Gly Ala 

130 135 140 

Gin Pro Ala Gin Pro Pro Ser Ser Tyr Ser Leu Pro Leu Leu Leu Cys 
145 150 155 160 

Lys Val Phe Arg Trp Pro Asp Leu Arg His Ser Ser Glu Val Lys Arg 

165 170 175 

Leu Cys Cys Cys Glu Ser Tyr Gly Lys lie Asn Pro Glu Leu Val Cys 

180 185 190 

Cys Asn Pro His His Leu Ser Arg Leu Cys Glu Leu Glu Ser Pro Pro 

195 200 205 

Pro Pro Tyr Ser Arg Tyr Pro Met Asp Phe Leu Lys Pro Thr Ala Asp 

210 215 220 

Cys Pro Asp Ala Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr 
225 230 235 240 

Leu Ala Pro Gly Gly Leu Ser Asp Ser Gin Leu Leu Leu Glu Pro Gly 

245 250 255 

Asp Arg Ser His Trp Cys Val Val Ala Tyr Trp Glu Glu Lys Thr Arg 

260 265 270 

Val Gly Arg Leu Tyr Cys Val Gin Glu Pro Ser Leu Asp lie Phe Tyr 

275 280 285 

Asp Leu Pro Gin Gly Asn Gly Phe Cys Leu Gly Gin Leu Asn Ser Asp 

290 295 300 

Asn Lys Ser Gin Leu Val Gin Lys Val Arg Ser Lys He Gly Cys Gly 
305 310 315 320 

He Gin Leu Thr Arg Glu Val Asp Gly Val Trp Val Tyr Asn Arg Ser 

325 330 335 

Ser Tyr Pro He Phe He Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser 

340 345 350 

Arg Thr Leu Leu Val His Lys Val Phe Pro Gly Phe Ser He Lys Ala 

355 360 365 

Phe Asp Tyr Glu Lys Ala Tyr Ser Leu Gin Arg Pro Asn Asp His Glu 

370 375 380 

Phe Met Gin Gin Pro Trp Thr Gly Phe Thr Val Gin He Ser Phe Val 
385 390 395 400 

Lvs Glv Trp Gly Gin Cys Tyr Thr Arg Gin Phe He Ser Ser Cys Pro 

405 410 415 

Cys Trp Leu Glu Val He Phe Asn Ser Arg 
420 425 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1393 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 90... 938 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GGCACGAGGT TGCCCTGGCG GAGCAGAGAC AGGCCCTCGG GGTGGAGGTC TTTGGTTTCA 60 
TAAGAGCCTG AGAGAGATTT TTCTAAGAT ATG TGT AAC ACA CCA ACG TAC TGT 113 

Met Cys Asn Thr Pro Thr Tyr Cys 
1 5 

GAC CTA GGA AAG GCT GCT AAG GAT GTC TTC AAC AAA GGA TAT GGC TTT 161 
Asp Leu Gly Lys Ala Ala Lys Asp Val Phe Asn Lys Gly Tyr Gly Phe 
10 15 20 

GGC ATG GTC AAG ATA GAC CTG AAA ACC AAG TCT TGT AGT GGA GTG GAA 209 
Gly Met Val Lys lie Asp Leu Lys Thr Lys Ser Cys Ser Gly Val Glu 
25 30 35 40 

TTT TCT ACT TCT GGT CAT GCT TAC ACT GAT ACA GGG AAA GCA TCA GGC 257 
Phe Ser Thr Ser Gly His Ala Tyr Thr Asp Thr Gly Lys Ala Ser Gly 
45 50 55 

AAC CTA GAA ACC AAA TAT AAG GTC TGT AAC TAT GGA CTT ACC TTC ACC 305 
Asn Leu Glu Thr Lys Tyr Lys Val Cys Asn Tyr Gly Leu Thr Phe Thr 
60 65 70 

CAG AAA TGG AAC ACA GAC AAT ACT CTA GGG ACA GAA ATC TCT TGG GAG 353 
Gin Lys Trp Asn Thr Asp Asn Thr Leu Gly Thr Glu lie Ser Trp Glu 
75 80 85 

AAT AAG TTG GCT GAA GGG TTG AAA CTG ACT CTT GAT ACC ATA TTT GTA 401 
Asn Lys Leu Ala Glu Gly Leu Lys Leu Thr Leu Asp Thr lie Phe Val 
90 95 100 

CCG AAC ACA GGA AAG AAG AGT GGG AAA TTG AAG GCC TCC TAT AAA CGG 449 
Pro Asn Thr Gly Lys Lys Ser Gly Lys Leu Lys Ala Ser Tyr Lys Arg 
105 HO H5 120 

GAT TGT TTT AGT GTT GGC AGT AAT GTT GAT ATA GAT TTT TCT GGA CCA 497 
Asp Cys Phe Ser Val Gly Ser Asn Val Asp lie Asp Phe Ser Gly Pro 
125 130 135 

ACC ATC TAT GGC TGG GCT GTG TTG GCC TTC GAA GGG TGG CTT GCT GGC 545 
Thr lie Tyr Gly Trp Ala Val Leu Ala Phe Glu Gly Trp Leu Ala Gly 
140 145 150 

TAT CAG ATG AGT TTT GAC ACA GCC AAA TCC AAA CTG TCA CAG AAT AAT 593 
Tyr Gin Met Ser Phe Asp Thr Ala Lys Ser Lys Leu Ser Gin Asn Asn 
155 160 165 

TTC GCC CTG GGT TAC AAG GCT GCG GAC TTC CAG CTG CAC ACA CAT GTG 641 
Phe Ala Leu Gly Tyr Lys Ala Ala Asp Phe Gin Leu His Thr His Val 
170 175 180 

AAC GAT GGC ACT GAA TTT GGA GGT TCT ATC TAC CAG AAG GTG AAT GAG 689 
Asn Asp Gly Thr Glu Phe Gly Gly Ser He Tyr Gin Lys Val Asn Glu 
185 190 195 200 
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AAG ATT GAA ACA TCC ATA AAC CTT GCT TGG ACA GCT GGG AGT AAC AAC 737 
Lys lie Glu Thr Ser lie Asn Leu Ala Trp Thr Ala Gly Ser Asn Asn 
205 210 215 

ACC CGT TTT GGC ATT GCT GCT AAG TAC ATG CTG GAT TGT AGA ACT TCT 785 
Thr Arg Phe Gly He Ala Ala Lys Tyr Met Leu Asp Cys Arg Thr Ser 
220 225 230 

CTC TCT GCT AAA GTA AAT AAT GCC AGC CTG ATT GGA CTG GGT TAT ACT 833 
Leu Ser Ala Lys Val Asn Asn Ala Ser Leu He Gly Leu Gly Tyr Thr 
235 240 245 

CAG ACC CTT CGA CCA GGA GTC AAA TTG ACT TTA TCA GCT TTA ATC GAT 881 
Gin Thr Leu Arg Pro Gly Val Lys Leu Thr Leu Ser Ala Leu He Asp 
250 255 260 

GGG AAG AAC TTC AGT GCA GGA GGT CAC AAG GTT GGC TTG GGA TTT GAA 929 
Gly Lys Asn Phe Ser Ala Gly Gly His Lys Val Gly Leu Gly Phe Glu 
265 270 275 280 

CTG GAA GCT TAATGTGGTT TGAGGAAAGC ATCAGATTTG TCCCTGGAAG TGAAGAGAA 987 
Leu Glu Ala 

ATGAACCCAC TATGTTTTGG CCTTAAAATT CTTCTGTGAA ATTTCAAAAG TGTGAACTTT 1047 

TTATTCTTCC AAAGAATTGT AATCCTCCCC ACACTGAAGT CTAGGGGTTG CGAATCCCTC 1107 

CTGAGGGAGA CGCTTGAAGG CATGCCTGGA AGTTGTCATG TTTGTGCCAC GTTTCAGTTC 1167 

AGTTCTGAAG TGTTATTAAA TGTGTTCCTC AGCGACAGTG TAGCGTCATG TTAGAGGAGA 1227 

CGATCTGACC CACCAGTTTG TACATCACGT CCTGCATGTC CCACACCATT TTTTCATGAC 1287 

CTTGTAATAT ACTGGTCTCT GTGCTATAGT GGAATCTTTG GTTTTGCATC ATAGTAAAAT 1347 

AAAATAAACC CATCACATTT GGAACATAAA AAAAAAAAAA AAAAAA 1393 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met 


Cys 


Asn 


Thr 


Pro 


Thr 


Tyr 


Cys 


Asp 


Leu 


Gly 


Lys 


Ala 


Ala 


Lys 


Asp 


1 






5 








10 










15 




Val 


Phe 


Asn 


Lys 


Gly 


Tyr 


Gly 


Phe 


Gly 


Met 


Val 


Lys 


He 


Asp 


Leu 


Lys 








20 








25 










30 


Ala 




Thr 


Lys 


Ser 


Cys 


Ser 


Gly 


Val 


Glu 


Phe 


Ser 


Thr 


Ser 


Gly 


His 


Tyr 




35 








40 










45 






Val 


Thr 


Asp 


Thr 


Gly 


Lys 


Ala 


Ser 


Gly 


Asn 


Leu 


Glu 


Thr 


Lys 


Tyr 


Lys 




50 








55 










60 








Thr 


Cys 


Asn 


Tyr 


Gly 


Leu 


Thr 


Phe 


Thr 


Gin 


Lys 


Trp 


Asn 


Thr 


Asp 


Asn 


65 






70 










75 










80 


Leu 


Gly 


Thr 


Glu 


He 


Ser 


Trp 


Glu 


Asn 


Lys 


Leu 


Ala 


Glu 


Gly 


Leu 


Lys 








85 










90 










95 


Gly 


Leu 


Thr 


Leu 


Asp 
100 


Thr 


He 


Phe 


Val 


Pro 
105 


Asn 


Thr 


Gly 


Lys 


Lys 
110 


Ser 


Lys 


Leu 


Lys 


Ala 


Ser 


Tyr 


Lys 


Arg 


Asp 


Cys 


Phe 


Ser 


Val 


Gly 


Ser 


Asn 




115 










120 










125 




Val 




Val 


Asp 


He 


Asp 


Phe 


Ser 


Gly 


Pro 


Thr 


He 


Tyr 


Gly 


Trp 


Ala 


Leu 




130 








135 










140 








Ala 


Ala 


Phe 


Glu 


Gly 


Trp 


L u 


Ala 


Gly 


Tyr 


Gin 


Met 


Ser 


Phe 


Asp 


Thr 


145 






150 










155 








Ala 


160 


Lys 


Ser 


Lys 


Leu 


Ser 


Gin 


Asn 


Asn 


Phe 


Ala 


Leu 


Gly 


Tyr 


Lys 


Ala 






165 










170 










175 
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Asp 


Phe 


Gin 


Leu 


His 


Thr 


His 


Val 


Asn 


Asp 


Gly 


Thr 


Glu 


Phe 


Gly 


Gly 






180 










185 










190 






Ser 


lie 


Tvr 


Gin 


Lvs 


Val 


Asn 


Glu 


Lys 


He 


Glu 


Thr 


Ser 


He 


Asn 


Leu 






195 








200 










205 








Ala 


Tro 


Thr 


Ala 


Gly 


Ser 


Asn 


Asn 


Thr 


Arg 


Phe 


Gly 


He 


Ala 


Ala 


Lys 




210 








215 










220 








Ala 


Tyr 


Met 


Leu 


Asp 


Cys 


Arg 


Thr 


Ser 


Leu 


Ser 


Ala 


Lys 


Val 


Asn 


Asn 


225 








230 










235 








Val 


240 


Ser 


Leu 


He 


Gly 


Leu 


Gly 


Tyr 


Thr 


Gin 


Thr 


Leu 


Arg 


Pro 


Gly 


Lys 








245 








250 










255 


Gly 


Leu 


Thr 


Leu 


Ser 


Ala 


Leu 


He 


Asp 


Gly 


Lys 


Asn 


Phe 


Ser 


Ala 


Gly 






260 










265 










270 






His 


Lys 


Val 


Gly 


Leu 


Gly 


Phe 


Glu 


Leu 


Glu 


Ala 














275 










280 



















(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1036 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
( ix ) FEATURE : 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1...546 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ACG AGC CTA GCC CTG GTG CTC AAC CTG CTG CAG ATC CAG AGG AAT GTC 48 
Thr Ser Leu Ala Leu Val Leu Asn Leu Leu Gin He Gin Arg Asn Val 
15 10 15 

ACT CTC TTC CCC GAG GAG GTG ATC GCC ACC ATC TTT TCC TCC GCC TGG 96 
Thr Leu Phe Pro Glu Glu Val He Ala Thr He Phe Ser Ser Ala Trp 
20 25 30 

TGG GTC CCT CCC TGC TGC GGG ACA GCA GCT GCT GTT GTT GGC CTA CTG 144 
Trp Val Pro Pro Cys Cys Gly Thr Ala Ala Ala Val Val Gly Leu Leu 
35 40 45 

TAC CCC TGT ATC GAC AGT CAC CTC GGA GAA CCC CAC AAA TTT AAG AGA 192 
Tyr Pro Cys He Asp Ser His Leu Gly Glu Pro His Lys Phe Lys Arg 
50 55 60 

GAA TGG GCC AGT GTC ATG CGC TGC ATA GCA GTT TTT GTT GGC ATT AAC 240 
Glu Trp Ala Ser Val Met Arg Cys He Ala Val Phe Val Gly He Asn 
65 70 75 80 

CAC GCC AGT GCT AAA TTG GAT TTT GCC AAT AAT GTC CAG CTG TCC TTG 288 
His Ala Ser Ala Lys Leu Asp Phe Ala Asn Asn Val Gin Leu Ser Leu 
85 90 95 

ACT TTA GCA GCC CTA TCT TTG GGC CTT TGG TGG ACA TTT GAT CGT TCC 336 
Thr Leu Ala Ala Leu Ser Leu Gly Leu Trp Trp Thr Phe Asp Arg Ser 
100 105 HO 

AGA AGT GGC CTT GGG CTG GGG ATC ACC ATA GCT TTT CTA GCT ACG CTG 384 
Arq Ser Gly Leu Gly Leu Gly He Thr He Ala Phe Leu Ala Thr Leu 
115 120 125 

ATC ACG CAG TTT CTC GTG TAT AAT GGT GTC TAT CAG TAT ACA TCC CCA 432 
He Thr Gin Phe Leu Val Tyr Asn Gly Val Tyr Gin Tyr Thr Ser Pro 
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1 



130 135 140 

GAT TTC CTC TAT ATT CGT TCT TGG CTC CCT TGT ATA TTT TTC TCA GGA 480 
Asp Phe Leu Tyr lie Arg Ser Trp Leu Pro Cys lie Phe Phe Ser Gly 
145 150 155 160 

GGC GTC ACG GTG GGG AAC ATA GGA CGA CAG TTA GCT ATG GGT GTT CCT 528 
Gly Val Thr Val Gly Asn lie Gly Arg Gin Leu Ala Met Gly Val Pro 
165 170 175 

GAA AAG CCC CAT AGT GAT TGAGTCTTCA AAACCACCGA TTCTGAGAGC AAGGAAGA 584 
Glu Lys Pro His Ser Asp 
180 

TTTTGGAAGA AAATCTGACT GTGGATTATG ACAAAGATTA TCTTTTTTCT TAAGTAATCT 644 

ATTTAGATCG GGCTGACTGT ACAAATGACT CCTGGAAAAA ACTCTTCACC TAGTCTAGAA 704 

TAGGGAGGTG GAGAATGATG ACTTACCCTG AAGTCTTCCC TTGACTGCCC GCACTGGCGC 764 

CTGTCTGTGC CCTGGAGCAT TCTGCCCAGG CTACGTGGGT TCAGGCAGGT GGCAGCTTCC 824 

CAAGTATTCG ATTTCATTCA TGTGATTAAA ACAAGTTGCC ATATTTCAAA AAAAAAAAAA 884 

AAAAMCTCGA GACCAACCCG CAGTTTTGTG TCAGTGCCCA AAGGAGGTAG GTTGATGGTG 944 

CTTAACAAAC ATGAAGTATG GTGTAATAGG AATAATATTT ATCCNAAAGA TTTTTAAAAA 1004 

TAGGGCTGTG TTTAAAAAAA AAAAAAAAAA AA 1036 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 





(xi) SEQUENCE 


DESCRIPTION: 


SEQ ID 


NO: 8: 










Thr 


Ser 


Leu 


Ala 


Leu 


Val 


Leu 


Asn 


Leu 


Leu 


Gin 


He 


Gin 


Arg 


Asn 


Val 


1 








5 










10 










15 




Thr 


Leu 


Phe 


Pro 


Glu 


Glu 


Val 


He 


Ala 


Thr 


He 


Phe 


Ser 


Ser 


Ala 


Trp 






20 










25 










30 






Trp 


Val 


Pro 


Pro 


Cys 


Cys 


Gly 


Thr 


Ala 


Ala 


Ala 


Val 


Val 


Gly 


Leu 


Leu 




35 








40 










45 








Tyr 


Pro 


Cys 


He 


Asp 


Ser 


His 


Leu 


Gly 


Glu 


Pro 


His 


Lys 


Phe 


Lys 


Arg 


50 








55 










60 










Glu 


Trp 


Ala 


Ser 


Val 


Met 


Arg 


Cys 


He 


Ala 


Val 


Phe 


Val 


Gly 


He 


Asn 


65 








70 










75 










80 


His 


Ala 


Ser 


Ala 


Lys 


Leu 


Asp 


Phe 


Ala 


Asn 


Asn 


Val 


Gin 


Leu 


Ser 


Leu 










85 










90 










95 




Thr 


Leu 


Ala 


Ala 


Leu 


Ser 


Leu 


Gly 


Leu 


Trp 


Trp 


Thr 


Phe 


Asp 


Arg 


Ser 








100 










105 










110 






Arg 


Ser 


Gly 


Leu 


Gly 


Leu 


Gly 


He 


Thr 


He 


Ala 


Phe 


Leu 


Ala 


Thr 


Leu 




115 










120 










125 








lie 


Thr 


Gin 


Phe 


Leu 


Val 


Tyr 


Asn 


Gly 


Val 


Tyr 


Gin 


Tyr 


Thr 


Ser 


Pro 




130 










135 










140 








Gly 


Asp 


Phe 


Leu 


Tyr 


He 


Arg 


Ser 


Trp 


Leu 


Pro 


Cys 


He 


Phe 


Phe 


Ser 


145 








150 










155 








Val 


160 


Gly 


Val 


Thr 


Val 


Gly 


Asn 


He 


Gly 


Arg 


Gin 


Leu 


Ala 


Met 


Gly 


Pro 








165 










170 










175 




Glu 


Lys 


Pro 


His 


Ser 


Asp 


























180 



























(2) INFORMATION FOR SEQ ID NO: 9: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1228 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : both 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
( ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1 468 

(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG TGT CAC TCT CGC AGC TGC CAC CCG ACC ATG ACC ATC CTG CAG GCC 48 
Met Cys His Ser Arg Ser Cys His Pro Thr Met Thr lie Leu Gin Ala 
1 5 10 15 

CCG ACC CCG GCC CCC TCC ACC ATC CCG GGA CCC CGG CGG GGC TCC GGT 96 
Pro Thr Pro Ala Pro Ser Thr lie Pro Gly Pro Arg Arg Gly Ser Gly 
20 25 30 

CCT GAG ATC TTC ACC TTC GAC CCT CTC CCG GAG CCC GCA GCG GCC CCT 144 
Pro Glu lie Phe Thr Phe Asp Pro Leu Pro Glu Pro Ala Ala Ala Pro 
35 40 45 

GCC GGG CGC CCC AGC GCC TCT CGC GGG CAC CGA AAG CGC AGC CGC AGG 192 
Ala Gly Arg Pro Ser Ala Ser Arg Gly His Arg Lys Arg Ser Arg Arg 
50 55 60 

GTT CTC TAC CCT CGA GTG GTC CGG CGC CAG CTG CCA GTC GAG GAA CCG 240 
Val Leu Tyr Pro Arg Val Val Arg Arg Gin Leu Pro Val Glu Glu Pro 
65 70 75 80 

AAC CCA GCC AAA AGG CTT CTC TTT CTG CTG CTC ACC ATC GTC TTC TGC 288 
Asn Pro Ala Lys Arg Leu Leu Phe Leu Leu Leu Thr He Val Phe Cys 
85 90 95 

CAG ATC CTG ATG GCT GAA GAG GGT GTG CCG GCG CCC CTG CCT CCA GAG 336 
Gin He Leu Met Ala Glu Glu Gly Val Pro Ala Pro Leu Pro Pro Glu 
100 105 HO 

GAC GCC CCT AAC GCC GCA TCC CTG GCG CCC ACC CCT GTG TCC CCC GTC 384 
Asp Ala Pro Asn Ala Ala Ser Leu Ala Pro Thr Pro Val Ser Pro Val 
115 120 125 

CTC GAG CCC TTT AAT CTG ACT TCG GAG CCC TCG GAC TAC GCT CTG GAC 432 
Leu Glu Pro Phe Asn Leu Thr Ser Glu Pro Ser Asp Tyr Ala Leu Asp 
130 135 140 

CTC AGC ACT TTC CTC CAG CAA CAC CCG GCC GCC TTC TAACTGTGAC TCCCCG 484 
Leu Ser Thr Phe Leu Gin Gin His Pro Ala Ala Phe 
145 150 155 

CACTCCCCAA AAAGAATCCG AAAAACCACA AAGAAACACC AGGCGTACCT GGTGCGCGAG 544 
AteCGTATCCC CAACTGGGAC TTCCGAGGCA ACTTGAACTC AGAACACTAC AGCGGAGACG 604 
T CCACCCGGTG CTTGAGGCGG GACCGAGGCG CACAGAGACC GAGGCGCATA GAGACCGAGG 664 
CACAGCCCAG CTGGGGCTAG GCCCGGTGGG AAGGAGAGCG TCGTTAATTT ATTTCTTATT 724 
GCTCCTAATT AATATTTATA TGTATTTATG TACGTCCTCC TAGGTGATGG AGATGTGTAC 784 
GTAATATTTA TTTTAACTTA TGCAAGGGTG TGAGATGTTC CCTCTGCTGT AAATGCAGGT 844 
CTCTTGGTAT TTATTGAGCT TTGTGGGACT GGTGGAAGCA GGACACCTGG AACTGCGGCA 904 
AAGTAGGAGA AGAAATGGGG AGGACTCGGG TGGGGGAGGA CGTCCCGGCT GGGATGAAGT 964 

CTGGTGGTGG GTCGTAAGTT TAGGAGGTGA CTGCATCCTC CAGCATCTCA ACTCCGTCTG 1024 

TCTACTGTGT GAGACTTCGG CGGACCATTA GGAATGAGAT CCGTGAGATC CTTCCATCTT 1084 

CTTGAAGTCG CCTTTAGGGT GGCTGCGAGG TAGAGGGTTG GGGGTTGGTG GGCTGTCACG 1144 

GAGCGACTGT CGAGATCGCC TAGTATGTTC TGTGAACACA AATAAAATTG ATTTACTGTC 1204 

AAAAAAAAAA AAAAAAAACT CGAG 1228 

(2) INFORMATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met 


Cys 


His 


Ser 


Arg 


Ser 


Cys 


His 


Pro 


Thr 


Met 


Thr 


lie 


Leu 


Gin 


Ala 


1 






5 










10 










15 




Pro 


Thr 


Pro 


Ala 


Pro 


Ser 


Thr 


He 


Pro 


Gly 


Pro 


Arg 


Arg 


Gly 


Ser 


Gly 






20 










25 










30 


Ala 




Pro 


Glu 


He 


Phe 


Thr 


Phe 


Asp 


Pro 


Leu 


Pro 


Glu 


Pro 


Ala 


Ala 


Pro 




35 








40 










45 








Ala 


Gly 


Arg 


Pro 


Ser 


Ala 


Ser 


Arg 


Gly 


His 


Arg 


Lys 


Arg 


Ser 


Arg 


Arg 




50 








55 










60 










Val 


Leu 


Tyr 


Pro 


Arg 


Val 


Val 


Arg 


Arg 


Gin 


Leu 


Pro 


Val 


Glu 


Glu 


Pro 


65 






70 










75 










80 


Asn 


Pro 


Ala 


Lys 


Arg 


Leu 


Leu 


Phe 


Leu 


Leu 


Leu 


Thr 


lie 


Val 


Phe 


Cys 








85 










90 










95 




Gin 


He 


Leu 


Met 


Ala 


Glu 


Glu 


Gly 


Val 


Pro 


Ala 


Pro 


Leu 


Pro 


Pro 


Glu 






100 








105 










110 




Val 


Asp 


Ala 


Pro 


Asn 


Ala 


Ala 


Ser 


Leu 


Ala 


Pro, 


Thr 


Pro 


Val 


Ser 


Pro 




115 










120 










125 


Ala 






Leu 


Glu 


Pro 


Phe 


Asn 


Leu 


Thr 


Ser 


Glu 


Pro 


Ser 


Asp 


Tyr 


Leu 


Asp 


130 










135 










140 










Leu 


Ser 


Thr 


Phe 


Leu 


Gin 


Gin 


His 


Pro 


Ala 


Ala 


Phe 











145 150 155 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3084 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1032 ♦..1736 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GAATTCGGCA CGAGGMCAGG AGCTCCTTTW CTGCGTCTCC CATCATGGGG CTTAGGGTTG 60 

AGTCTTCAGG TTCTGGGGGC AGGAAGGACG GGCACTCAGG AGGCCCCCTC CCCATCCACA 120 

GCCCCTCTTT GGGAGGGGGG AAACTTGGCA ACCCGGGAGG CATGTGGATC TTTTCCTAAG 180 

CAAGATGCTG AGCTGGAAAG ATGGGGGTGT AAGGTAATGT CCCAAACTGA AACTTTGCCA 240 

GGCACTGGGA GAGGCTGTGA ACTCTTTTCT GGCTTTAGAA TTTAGGTCTA GATCCCAAAA 300 

GGCTAAGTAC CCCCTGGGGG CTAACCAGAG GCATGCCTGG GCTGAGCTGA ACCTTCTGGT 360 

GCACTGGCCC CTGGCTGACT GCTCTTCTGC AGGAAGTTGG AGGAGATTCC TGAAGTTGAT 420 

TCCTCAGGCT GGATGTCCAA GGGGGTTGGA GTTTCTGATG TCTTTCTGTC TCCCTCTCTT 480 

TTCTTTCTCT CCCTACCAGG TCCACTTCTT TCAGAGGGGC CTGCGGTGCT CTAAAAGTTC 540 

TCCTGTTAAA GTTTAGAGCA AATTGGTTAT TATTTTAAAA TCAATAAAAC TTTTAAAAGT 600 

ACTAAGACAA CTTCTAAGAG GGGAGTGGAC AGAGGGCCTG GTGGCAGCTC ACAGTTTCTT 660 

TTCTGACCTT TGGTCTCACC CACCAAGTGT CCCACCTGAG TGCCCACCTT GCCCACCTGA 720 

GGTAATGCCC TGGGGCTCCA CCAGTCCAGA TCCACAGGGC GCARCCATGT GGGAGTGGCG 780 

GCTGATTGTT ACCCAGTAGT GTTGATAGCA CATTATTCAT AACAGCCAAA GAGAGGAAGC 840 

AACCCAAATG TCCATTAGCT GATAAATGGA TAAATGAAAT ATGGTACGTC CGAAGAATGG 900 

AATATCATTC ACCCATGAAA AAGAACGAAG TCCAGCACCA AAACGTGCTA CAACATGGAT 960 

GAACTTCGAT GACTTTGTGC CACATGAAAG AAGAAGCCAG CCACAAAAGG CCATATATTG 1020 
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TATGAAATGA A ATG TCC AGA ATG GGC AAA CCC ATA GAG ACA CAA AAA TCT 
Met Ser Arg Met Gly Lys Pro lie Glu Thr Gin Lys Ser 
15 10 

CCG CCA CCT CCC TAC TCT CGG CTG TCT CCT CGC GAC GAG TAC AAG CCA 
Pro Pro Pro Pro Tyr Ser Arg Leu Ser Pro Arg Asp Glu Tyr Lys Pro 
15 20 25 

CTG GAT CTG TCC GAT TCC ACA TTG TCT TAC ACT GAA ACG GAG GCT ACC 
Leu Asp Leu Ser Asp Ser Thr Leu Ser Tyr Thr Glu Thr Glu Ala Thr 
30 35 40 45 

AAC TCC CTC ATC ACT GCT CCG GGT GAA TTC TCA GAC GCC AGC ATG TCT 
Asn Ser Leu He Thr Ala Pro Gly Glu Phe Ser Asp Ala Ser Met Ser 
50 55 60 

CCG GAC GCC ACC AAG CCG AGC CAC TGG TGC AGC GTG GCG TAC TGG GAG 
Pro Asp Ala Thr Lys Pro Ser His Trp Cys Ser Val Ala Tyr Trp Glu 
65 70 75 

CAC CGG ACG CGC GTG GGC CGC CTC TAT GCG GTG TAC GAC CAG GCC GTC 
His Arg Thr Arg Val Gly Arg Leu Tyr Ala Val Tyr Asp Gin Ala Val 
80 85 90 

AGC ATC TTC TAC GAC CTA CCT CAG GGC AGC GGC TTC TGC CTG GGC CAG 
Ser He Phe Tyr Asp Leu Pro Gin Gly Ser Gly Phe Cys Leu Gly Gin 
95 100 105 

CTC AAC CTG GAG CAG CGC AGC GAG TCG GTG CGG CGA ACG CGC AGC AAG 
Leu Asn Leu Glu Gin Arg Ser Glu Ser Val Arg Arg Thr Arg Ser Lys 
HO 115 120 125 

ATC GGC TTC GGC ATC CTG CTC AGC AAG GAG CCC GAC GGC GTG TGG GCC 
He Gly Phe Gly He Leu Leu Ser Lys Glu Pro Asp Gly Val Trp Ala 
130 135 140 

TAC AAC CGC GGC GAG CAC CCC ATC TTC GTC AAC TCC CCG ACG CTG GAC 
Tyr Asn Arg Gly Glu His Pro He Phe Val Asn Ser Pro Thr Leu Asp 
145 150 155 

GCG CCC GGC GGC CGC GCC CTG GTC GTG CGC AAG GTG CCC CCC GGC TAC 
Ala Pro Gly Gly Arg Ala Leu Val Val Arg Lys Val Pro Pro Gly Tyr 
160 165 170 

TCC ATC AAG GTG TTC GAC TTC GAG CGC TCG GGC CTG CAG CAC GCG CCC 
Ser He Lys Val Phe Asp Phe Glu Arg Ser Gly Leu Gin His Ala Pro 
175 180 185 

GAG CCC GAC GCC GCC GAC GGC CCC TAC GAC CCC AAC AGC GTC CGC ATC 
Glu Pro Asp Ala Ala Asp Gly Pro Tyr Asp Pro Asn Ser Val Arg He 
190 195 200 205 

AGC TTC GCC AAG GGC TGG GGG CCC TGC TAC TCC CGG CAG TTC ATC ACC 
Ser Phe Ala Lys Gly Trp Gly Pro Cys Tyr Ser Arg Gin Phe He Thr 
210 215 220 

TCC TGC CCC TGC TGG CTG GAG ATC CTC CTC AAC AAC CCC AGA TAGTGGCGG 
Ser Cys Pro Cys Trp Leu Glu He Leu Leu Asn Asn Pro Arg 
225 230 235 



1070 



1118 



1166 



1214 



1262 



1310 



1358 



1406 



1454 



1502 



1550 



1598 



1646 



1694 



1745 



CCCCGGCGGG 
GCCGATGCCC 
TTAATATAAA 
TTTTTACAAT 
ACGCACTTTG 
CTTCCTTACT 
CCTGGGTTTG 



AGGGGCGGGT 
AGAGACACAG 
GTTTTATATA 
GTAATTATTT 
GCTTATAATT 
TTTTATATAT 
GTGTATGGTT 



GGGAGGCCGC 
CCCCCACGGA 
TTATATGGAA 
ATGTATGGTG 
CTTTCAATAC 
ATATATAAAG 
TTTGAGATAT 



GGCCACCGCC 
CAAAACCCCC 
ATATATATTA 
CAATGTGTGT 
AGATATATTT 
AAAATGATAC 
TAATGCCCAG 



ACCTGCCGGC 
CAGATATCAT 
TACTTGTAAT 
ATATGGACAA 
TCTTTCTCTT 
AGCAGAGCTA 
ACAAAAAGCT 



CTCGAGAGGG 
CTACCTAGAT 
TATGGAGTCA 
AACAAGAAAG 
CCTCCTTCCT 
GGTGGAAAAG 
AATACCAGTC 



1805 
1865 
1925 
1985 
2045 
2105 
2165 
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ACTCGATAAT AAAGTATTCG CATTATAGTT TTTTTTAAAC TGTCTTCTTT TTACAAAGAG 2225 

GGGCAGGTAG GGCTTCAGCG GATTTCTGAC CCATCATGTA CCTTGAAACT TGACCTCAGT 2285 

TTTCAAGTTT TACTTTTATT GGATAAAGAC AGAACAAATT GAAAAGGGAG GAAAGTCACA 2345 

TTTACTCTTA AGTAAACCAG AGAAAGTTCT GTTGTTCCTT CCTGCCCATG GCTATGGGGT 2405 

GTCCAGTGGA TAGGGATGGC GGTGGGGAAA AGGAGAATAC ACTGGCCATT TATCCTGGAC 2465 

AAGCTCTTCC AGTCTGATGG AGGAGGTTCA TGCCCTAGCC TAGAAAGGCC CAGGTCCATG 2525 

ACCCCCATCT TTGAGTTATG AGCAAGCTAA AAGAAGACAC TATTTCTCAC CATTTTGTGG 2585 

AAATGGCCTG GGGAACAAAG ACTGAAATGG GCCTTGAGCC CACCTGCTAC CTTGCAGAGA 2645 

ACCATCTCGA GCCCCGTAGA TCTTTTTAGG ACCTCCACAG GSTATTTCCC ACCCCCCAGC 2705 

CAAAAATAGC TCAGAATCTG CCCATCCAGG GCTTGTATTA ATGATTTATG TAAAGGCAGA 2765 

TGGTTTATTT CTACTTTGTA AAAGGGAAAA GTTGAGGTTC TGGAAGGATA AATGATTTGC 2825 

TCATGAGACA AAATCAAGGT TAGAAGTTAC ATGGAATTGT AGGACCAGAG CCATATCATT 2885 

AGATCAGCTT TCTGAAGAAT ATTCTCCAMA AAAGAAAGTC TCCTTGGCCA GATAACTAAG 2945 

AGGAATGTTT CATTGTATAT CTTTTTTCTT GGAGATTTAT ATTAACATAT TAAGTGCTCT 3005 

GAGAAGTCCT GTGTATTATC TCTTGCTGCA TAATAAATTA TCCCCAMACT TAAAAAAAAA 3065 

AAAAAAAAAA AAACTCGAG 3084 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

/b Ser Pro Pro Pro 
15 

Leu Asp Leu 
30 



80 

lie Ph€ 
95 



Met 


Ser 


Arg 


Met 


Gly 


Lys 


Pro 


He 


Glu 


Thr 


Gin 


Lys 


Ser 


1 






5 










10 








Pro 


Tyr 


Ser 


Arg 


Leu 


Ser 


Pro 


Arg 


Asp 


Glu 


Tyr 


Lys 


Pro 






20 










25 










Ser 


Asp 


Ser 


Thr 


Leu 


Ser 


Tyr 


Thr 


Glu 


Thr 


Glu 


Ala 


Thr 




35 










40 










45 


He 


Thr 


Ala 


Pro 


Gly 


Glu 


Phe 


Ser 


Asp 


Ala 


Ser 


Met 


Ser 




50 








55 










60 


Glu 


Thr 


Lys 


Pro 


Ser 


His 


Trp 


Cys 


Ser 


Val 


Ala 


Tyr 


Trp 


65 








70 










75 




Val 


Arg 


Val 


Gly 


Arg 


Leu 


Tyr 


Ala 


Val 


Tyr 


Asp 


Gin 


Ala 






85 










90 








Tyr 


Asp 


Leu 


Pro 


Gin 


Gly 


Ser 


Gly 


Phe 


Cys 


Leu 


Gly 


Gin 




100 










105 










Glu 


Gin 


Arg 
115 


Ser 


Glu 


Ser 


Val 


Arg 
120 


Arg 


Thr 


Arg 


Ser 


Lys 
125 


Gly 


He 


Leu 


Leu 


Ser 


Lys 


Glu 


Pro 


Asp 


Gly 


Val 


Trp 


Ala 


130 










135 










140 




Gly 


Glu 


His 


Pro 


He 


Phe 


Val 


Asn 


Ser 


Pro 


Thr 


Leu 


Asp 


145 










150 










155 






Gly 


Arg 


Ala 


Leu 


Val 


Val 


Arg 


Lys 


Val 


Pro 


Pro Gly 


Tyr 






165 










170 








Val 


Phe 


Asp 


Phe 


Glu 


Arg 


Ser 


Gly 


Leu 


Gin 


His 


Ala 


Pro 






180 










185 










Ala 


Ala 


Asp 


Gly 


Pro 


Tyr 


Asp 


Pro 


Asn 


Ser 


Val 


Arg 


He 






195 








200 










205 


Lys 


Gly 


Trp 


Gly 


Pro 


Cys 


Tyr 


Ser 


Arg 


Gin 


Phe 


He 


Thr 


210 








215 










220 




Cys 


Trp 


Leu 


Glu 


He 


Leu 


Leu 


Asn 


Asn 


Pro 


Arg 






225 








230 










235 







110 



160 
He Lys 
175 



190 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TTTTTTTTTT TNC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GTGAGGCGTC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TGGACCGGTG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
TTTTTTTTTT TNA 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
AGACGTCCAC 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ACTTCGCCAC 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
TCGGACGTGA 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
CATTTCATTT CATACAA 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
CATTTCATTT CATACAATAT ATG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
CATTTCATTT CATACAATAT ATGGCCTTT 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CATTTCATTT CATACAATAT ATGGCCTTTT GTGGC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GGACATTTCA TTTCATACAA TATATGGCCT TTTGT 

(2) INFORMATION FOR SEQ ID NO: 25: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
TTCATTTCAT ACAATATATG GCCTTTTGT 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
TCATACAATA TATGGCCTTT TGT 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
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AATATATGGC CTTTTGT 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
CATGCGGGGC GAGGAGG 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
CATGCGGGGC GAGGAGGCGA GGA 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CATGCGGGGC GAGGAGGCGA GGAGAAAAG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
CATGCGGGGC GAGGAGGCGA GGAGAAAAGT CGTTT 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Other 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GAACATGCGG GGCGAGGAGG CGAGGAGAAA AGTCG 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

( B ) TYPE : nucleic ac id 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GCGGGGCGAG GAGGCGAGGA GAAAAGTCG 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
CGAGGAGGCG AGGAGAAAAG TCG 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
GGCGAGGAGA AAAGTCG 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
CAAAGCNGNN NNNNCNGAGN AGUC 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GGUGGAGCCC CAGGGCAUUA CCUCAAAGCN GNNNNNNCNG AGNAGUCGUG GGCAAGGUGG 60 
GCACUCAGGU GGG 73 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

GUGUCUCUAU GGGUUUGCCC AAAGCNGNNN NNNCNGAGNA GUCUCUGGAC AUUUCAUUUC 60 
AUAC 64 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GGCCCUCUCG CCGUCGGGCU CCUUGCUGAG CAAAGCNGNN NNNNCNGAGN AGUCGAUGCC 60 
GAAGCCGAUC UUGCUGCGCG 80 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CGUUUGCCUG CUAAGGAGCG AACAAAGCNG NNNNNNCNGA GNAGUCGAUG UUUCUUUGUG 60 
AGUCGGGCGC CG 72 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CGCCGGACGA GCGCAGAUCG UUUGGUCCUG AACAAAGCNG NNNNNNCNGA GNAGUCCGGG 60 
GCGAGGAGGC GAGGAGAAAA GUCG 64 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

GGAGUAAGGA GGGGGGGGAG ACUCUAGUUC GCAAAGCNGN NNNNNCNGAG NAGUCAGUCG 
GCUAAGGUGA UGGGGGUUGC AGCACACC 

(2) INFORMATION FOR SEQ ID NO: 43: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 

( D ) TOPOLOGY : u nknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Tvr Thr Asp Thr Gly Lys Ala Ser Gly Asn Leu Glu Thr Lys Tyr Lys 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
Thr Gly Lys Lys Ser Gly Lys Leu Lys Ala Ser <Tyr Lys Arg Asp 



1 



5 



10 
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